Claims 

What is claimed is: 

1. A method of adapting a speech recognition system to speech data provided 
to the speech recognition system, the method comprising the steps of: 
5 computing aUgnment information between the speech recognition system 

and featm-e vectors associated with the speech data provided to the speech recognition 
system; 

computing an original spectra for each feature vector and corresponding 

mean vector; 

10 estimating one or more transformation parameters which maximize a 

likelihood of an utterance; and 

transforming a current feature vector using the estimated transformation 
parameters and maximum Ukeiihood criteria, the transformation being performed in a 
linear spectral domain. 

15 2. The method of claim 1, wherein the step of transforming the current 

feature vector is performed in feature space. 

3. The method of claim I, wherein the step of transforming the current 
feature vector is performed in model space. 

4. The method of claim 1, wherein the maximum likeUhood criteria is a 
20 maximum likelihood spectral transformation (MLST). 

5. The method of claim 1, wherein the step of estimating one or more 
transformation parameters which maximize a likelihood of an utterance further comprises 
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the step of computing likelihood of utterance information corresponding to a previous 
feature vector transformation. 



6. The method of claim 1, wherein the step of estimating the transformation 
parameters further comprises the step of estimating convolutional noise and additive 

5 noise for each zth component of a speech vector corresponding to the speech data 

provided to the speech recognition system. 

7. The method of claim 6, wherein the step of estimating the transformation 
parameters further comprises the step of defining a diagonal matrix A with An = \IN% and 
defining ^),- = -ivfw?. 



10 8. The method of claim 7, further comprising the steps of: 

determining An in accordance with an expression 

^" ~ T^y ^(e)2 y (a) y U) I 

determining bi in accordance with an expression 
bi = ; . 

15 where x|f and are sub-linear spectral values of a feature vector and 

corresponding mean vector, respectively, for each ith component of the speech vector. 



9. The method of claim 1, wherein the step of transforming the current 
feature vector fiirther comprises the step of determining = -^xf - j^, where xf^ is an 
20 zth component of a speech vector corresponding to the speech data provided to the speech 

recognition system, Nf is convolutional noise and is additive noise of the ith 
component of the speech vector. 
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10. The method of claim 1, wherein the step of computing ahgnment 
information is performed using a Baum- Welch algorithm. 

11. In a speech recognition system, a method of transforming speech feature 
vectors associated with speech data provided to the speech recognition system, the 

5 method comprising the steps of: 

receiving likelihood of utterance information corresponding to a previous 
feature vector transformation; 

estimating one or more transformation parameters based, at least in part, 
on the likeUhood of utterance information corresponding to a previous feature vector 
10 transformation; and 

transforming a current feature vector based on at least one of maximum 
Ukelihood criteria and the estimated transformation parameters, the transformation being 
performed in a linear spectral domain. 

12. The method of claim 11, wherein the step of transforming the current 
1 5 feature vector is performed in feature space. 

13. The method of claim 11, wherein the step of transforming the current 
feature vector is performed in model space. 

14. The method of claim 11, wherein the maximxmi likelihood criteria is a 
maximum likelihood spectral transformation (MLST). 

20 15. Apparatus for adapting a speech recognition system to speech data 

provided to the speech recognition system, the apparatus comprising: 
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at least one processing device operative to: (i) compute alignment 
information between the speech recognition system and feature vectors associated with 
the speech data provided to the speech recognition system; (ii) compute an original 
spectra for each feature vector and a corresponding mean vector; (iii) estimate one or 
5 more transformation parameters which maximize a likelihood of an utterance; and (iv) 

transform a current feature vector based on at least one of maximum likelihood criteria 
and the estimated transformation parameters, the transformation being performed in a 
Hnear spectral domain. 

16. The apparatus of claim 15, wherein the operation of transforming the 
1 0 current feature vector is performed in a feature space. 

17. The apparatus of claim 15, wherein the operation of transforming the 
current feature vector is performed in a model space. 

18. The apparatus of claim 15, wherein the spectral transformation employed 
in the operation of transforming the current feature vector is a maximum Hkelihood 

15 spectral transformation (MLST). 

19. The apparatus of claim 15, wherein the operation of estimating one or 
more transformation parameters which maximize a likelihood of an utterance further 
comprises the operation of computing likelihood of utterance information corresponding 
to a previous feature vector transformation. 

20 20. The apparatus of claim 15, wherein the operation of estimating the 

transformation parameters further includes the operation of estimating convolutional 
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noise TVf and additive noise ivf for each ith component of a speech vector provided to the 
speech recognition system. 

21. The apparatus of claim 20, wherein the operation of estimating the 
transformation parameters further includes the operation of defining a diagonal matrix A 

5 with An = 1/Nf, and defining bt - ~N^/Nf, 

22. The apparatus of claim 21, wherein the operation of estimating the 
transformation parameters fiirther comprises the operation of: 

determining^!/ in accordance with an expression 

tt x^^>2 V j5)y At) ; ^d 

1 0 determining hi in accordance with an expression 

Oi— J ; 

where x[f and mfj are sub-linear spectral values of a feature vector and 
corresponding mean vector, respectively, for each zth component of the speech vector. 

23. The apparatus of claim 15, wherein the operation of transforming the 
15 current feature vector includes the step of determining xf = ^xf^ - ^, where xf is an 

zth component of a speech vector corresponding to the speech data provided to the speech 
recognition system, is convolutional noise and A^f is additive noise of the zth 
component of the speech vector. 
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